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DETAILED ACTION 

Status of Application, Amendments and/or Claims 

The amendment of 12/6/06 has been entered in full. Claims 68-92 are canceled. 
New claims 93-1 17 are added. 

Claims 93-1 17 are under consideration in the instant application. 

The text of those sections of Title 35, U.S. Code not included in this action can 
be found in a prior Office action. 

In view of the new grounds of rejection, the finality of the previous Office Action is 
withdrawn. 

Withdrawn Objections and/or Rejections 

The following page numbers refer to the previous Office Action (9/6/2006). 
All rejections of claims 68-92 are moot in view of Applicants' cancellation of these 
claims. 

The rejection of claims 68-92 under 35 U.S.C. § 101 for lack of utility set forth at 
page 2-4 has not been applied to new claims 93-1 17 in view of Applicants' resubmission 
of the Affidavit pursuant to 37 CFR 1.132 and Figures 1-2 associated which has been 
fully considered by the Examiner and provides sufficient evidence of utility for the 
claimed nucleic acids. 

The rejection of claims 68-92 under 35 U.S.C. 112, first paragraph for lack of 
enablement set forth at pg 4-7 has not been applied to new claims 93-1 1 7 in view of (1 ) 
Applicants' resubmission of the Affidavit pursuant to 37 CFR 1 . 1 32 and Figures 1 -2 
associated therewith which have been found to be sufficient; and (2) in view of the 
different claim limitations presented in new claims 93-1 17. 

The rejection of claims 68-92 under 35 U.S.C. 112, first paragraph for lack of 
written description set forth at pg 7-10 has not been applied to new claims 93-1 17 in 
view of the different claim limitations presented in new claims 93-117. 

The additional grounds for rejection of claim 92 under 35 U.S.C. 112, first 
paragraph for lack of written description set forth at pg 9-10 is also withdrawn on further 
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consideration by the Examiner; therefore, this rejection has not been applied to new 
claim 117. On further consideration by the Examiner, the human T2R sequences 
described in the specification provide sufficient written description in order to identify the 
T2R nucleic acid sequences from other species that are encompassed by claim 117. 

The rejection of claims 68-73 under 35 U.S.C. § 102(a) as being anticipated by 
WO200257309-A1 (Miwa et al, published July 25, 2002) set forth at pg 12-14 has not 
been applied to new claims 93-1 1 7 in view of the combination of (1 ) Applicants' 
submission on 12/6/06 of a 131 Affidavit executed by all of the inventors except for Elliot 
Adler; and (2) Applicants' 131 submission on 9/3/2005 of a 131 Affidavit executed by 
Robin Teskin; this Affidavit has been considered effective in place of an Affidavit 
executed by Elliot Adler in view of the Petition under 37 C.F.R 1 .47(a) granted 
3/1 1/2004. It is noted that MPEP 715.04 [R-5] states, "The following parties may make 
an affidavit or declaration under 37 CFR 1.131 ...(c) If a petition under 37 CFR 1.47 was 
granted ... the affidavit or declaration may be signed by the 37 CFR 1 .47 applicant or 
the legal representative, where appropriate." However, please see the new rejection 
under 35 U.S.C § 102(a) set forth below. 



Note 

At page 6 of the 12/6/06 response, Applicants state "Applicants respectfully note 
that the PTOL-326 incorrectly states that this Office Action is a final rejection. In fact the 
text of the Office Action correctly notes that the Office Action is a non-final rejection 
based on the RCE Request." 

In response, the Examiner notes that the 9/6/06 Office Action was correctly 
indicated to be a final rejection both on the PTOL-326 and in the Office Action at pg 14, 
in accordance with MPEP § 706.07(b). Applicants 1 request for continued examination 
(RCE) under 37 CFR 1.114 was entered at the beginning of the 9/6/06 Office Action 
(see page 2). However, the conclusion of the Office Action (page 14) stated, "All claims 
are drawn to the same invention claimed in the application prior to the entry of the 
submission under 37 CFR 1.1 14 and could have been finally rejected on the grounds 
and art of record in the next Office action if they had been entered in the application 
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prior to entry under 37 CFR 1.114. Accordingly, THIS ACTION IS MADE FINAL even 
though it is a first action after the filing of a request for continued examination and the 
submission under 37 CFR 1.114. See MPEP § 706.07(b). Applicants are reminded of 
the extension of time policy as set forth in 37 CFR 1.136(a)." 

However, in view of the new grounds of rejection set forth below, the finality of 
the previous Office Action is withdrawn. 

Maintained Objections and/or Rejections 
Claim Rejections - 35 USC § 112, 2nd paragraph 

Claims 93-117 are rejected under 35 U.S.C. § 112, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. This rejection was set forth for claims 68-92 at pg 
11-12 of the 9/6/06 Office Action; these claims have been cancelled but the rejection is 
herewith applied to claims 93-117. 

Claims 93-97 are indefinite because the metes and bounds of the phrase 
"contained in" are unclear. For example, in claim 93 it is unclear whether a "polypeptide 
contained in SEQ ID NO: 2" is limited to a polypeptide consisting of SEQ ID NO: 2, or 
whether it encompasses shorter polypeptides (fragments) that are "contained in" the 
longer sequence of SEQ ID NO: 2. For purposes of prosecution, claims 93-97 have 
been interpreted broadly to include polynucleotides comprising fragments of SEQ ID 
NO: 1, or polynucleotides comprising fragments of polynucleotides encoding SEQ ID 
NO: 2. 

The remaining claims are rejected for depending from an indefinite claim. 

Applicants' 12/6/06 response at page 9 states "[previous claims 68-92 were also 
rejected under 35 USC 112 second paragraph. This rejection is not applicable to the 
current claims which do not recite "contained in"..." 

Applicants' arguments have been fully considered but are not found persuasive. 
Contrary to Applicants' statement at pg 9 of the response, new claims 93-97 each recite 
the phrase "contained in" one or more times. Specifically, it is noted that claim 93 uses 
the phrase "contained in" in five instances (in lines 3, 4, 5, 7, 10); claim 94 uses the 
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same phrase in two instances (in lines 2 and 4); claim 95 uses the same phrase in two 
instances (in lines 2 and 4); and claims 96 and 97 use the same phrase in one instance 
(each in line 2). It is noted that this rejection would be withdrawn if the claims were 
amended, for example, to recite "of in place of each usage of "contained in" (for . 
example, claim 96 would recite "the sequence of SEQ ID NO: 1"). 

New objections and/or rejections 
Claim Objections 

Claim 93 is objected to because of the following informalities: . 

(1) In claim 93, line 3, the word "polypeptide" is misspelled as "polypetide". 

Appropriate correction is required. 

Claim Rejections - 35 USC § 101 

35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

Claims 106-108 are rejected under 35 U.S.C. 101 because the claimed invention 
is directed to non-statutory subject matter. 

Claim 106, as written, does not sufficiently distinguish over cells that exist 
naturally because the claims do not particularly point out any non-naturally occurring 
differences between the claimed products and the naturally occurring products. 
Specifically, the claim encompasses non-isolated host cells found within an organism 
that contain the nucleic acid sequences encompassed by the claim. In the absence of 
the hand of man, the naturally occurring products are considered non-statutory subject 
matter. See Diamond v. Chakrabarty, 447 U.S. 303, 206 USPQ 193 (1980). Claim 106 
should be amended to indicate the hand of the inventor, e.g. by amendment to recite 
"An isolated cell..." See MPEP 2105. 

Claims 107 and 108 depend from claim 106 and are included in the rejection 
because they each encompass non-isolated host cells. 
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Claim Rejections - 35 USC § 112, 1st paragraph, new matter 

Claims 93-117 are also rejected under 35 U.S.C. 112, first paragraph, as failing 
to comply with the written description requirement because the claims contain new 
matter. 

New claim 93 recites, "...wherein high stringency conditions are incubating for 5 
minutes in 0.02X SSC at 65 degrees C..." However, the specification as originally filed 
does contain support for this limitation. The specification at page 16, paragraph [0044] 
teaches, "An example of highly stringent wash conditions is 15 minutes in 0.1 X SSC at 
65°C. An example of stringent was conditions is 15 minutes in 0.2 X SSC buffer at 
65°C." There is no conception in the specification of the limitation of "0.02X SSC" as an 
example of high stringency condition, nor does the concept of the specific limitation flow 
naturally from the disclosure of the specification. Therefore, the specification as 
originally filed lacks support for this limitation. Claims 94-1 17 are included in this 
rejection because they depend from claim 93. 

Claim Rejections - 35 USC § 102a 

Claims 93-98 are rejected under 35 U.S.C. 102(a) as being anticipated by NCBI 
Entrez Nucleotide, database record for Locus AC092214, Version AC092214.2, 15 
September 2001 , 60 pages. The earliest date to which the instant application claims 
priority is July 29 th , 2002. 

The database record for Locus AC092214, version AC092214.2, teaches a DNA 
sequence of 209,501 base pairs designated "Homo sapiens chromosome 7 clone 
RP1 1-81 1 J9, working draft sequence, 3 unordered pieces". An alignment between this 
DNA sequence and SEQ ID NO: 1 of the instant application indicates that residues 
158107-157151 of AC092214.2 are 99.9% similar to instant SEQ ID NO: 1 (an 
alignment of these sequences is attached to this Office Action as Sequence Alignment 
#1). The only difference between the two sequences is that residue 930 of SEQ ID NO: 
1, is listed as 'Y\ which indicates a pyrimidine residue (either a thymine (T) or cytosine 
('C')), and that residue 157236 of AC092214.2 is listed as a thymine reside (T). 
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Because a pyrimidine residue indicates a genus that includes either thymine or 
cytosine, the sequence of AC092214.2 effectively teaches two sequences, one of which 
comprises a species encompassed by SEQ ID NO: 1. Furthermore, an alignment 
between the protein encoded by AC092214.2 and SEQ ID NO: 2 (the polypeptide 
encoded by SEQ ID NO: 1) indicates that residues 158107-157151 of AC092214.2 
encode a polypeptide that is 100% identical to instant SEQ ID NO: 2 (an alignment of 
these sequences is attached to this Office Action as Sequence Alignment #2). The 
record for AC092214.2 indicates that this sequence is "Homo sapiens chromosome 7 
clone RP1 1 -81 1 J9" which meets the limitation of "isolated". Therefore, because 
AC092214.2 encodes a polypeptide that is 100% identical to instant SEQ ID NO: 2, this 
isolated DNA sequence anticipates each of instant claims 93-95 and 97. Furthermore, 
because AC092214.2 comprises one of the two sequences "contained in" SEQ ID NO: 
1, AC092214.2 also anticipates claim 96. Finally, the record for AC092214.2 indicates 
the sequence is from "genomic DNA" (see page 2) and therefore AC092214.2 also 
anticipates instant claim 98, which depends from claim 93 and encompasses a 
"genomic sequence". 

Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

Claims 93, 96, 97 and 98 are rejected under 35 U.S.C. 102(b) as being 
anticipated by NCBI Entrez Nucleotide, database record for Locus AC092214, Version 
AC092214J_, 29 June 2001 , 76 pages. The earliest date to which the instant application 
claims priority is July 29 th , 2002. 

This rejection is based on claims 93 and 96-98 encompassing polynucleotides 
comprising fragments of SEQ ID NO: 1 , or comprising fragments of polynucleotides 
encoding SEQ ID NO: 2. See "Claim Rejections - 35 U.S.C. 1 12, 2nd Paragraph. 
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The database record for Locus AC092214, version AC092214J., teaches a DNA 
sequence of 266224 base pairs designated "Homo sapiens chromosome UNK clone 
RP11-811J9, working draft sequence, 6 unordered pieces". An alignment between this 
DNA sequence and instant SEQ ID NO: 1 indicates an overall 83.9% similarity (an 
alignment of these sequences is attached to this Office Action as Sequence Alignment 
#3). Specifically, residues 7960-8802 of AC092214.1 are 97.6% similar to residues 117- 
957 of instant SEQ ID NO: 1, for an overall 83.9% similarity. Furthermore, an alignment 
between the protein encoded by AC092214.2 and SEQ ID NO: 2 (the polypeptide 
encoded by SEQ ID NO: 1) indicates that the encoded proteins share an overall 83.8% 
similarity (an alignment of these sequences is attached to this Office Action as 
Sequence Alignment #4). Specifically, residues 7941-8799 of AC092214.1 encode a 
protein that is 96.5% identical to residues 34-318 of SEQ ID NO: 2, for an overall 83.8% 
similarity between the two proteins. Furthermore, within the region of identity between 
the polynucleotide sequence of AC092214.1 and SEQ ID NO: 1 are numerous 
sequence of two or more polynucleotides that are 100% identical. Therefore, the 
polynucleotide sequence of AC092214.1 "has the sequence contained in SEQ ID NO: 
1" and "encodes a polypeptide comprising the sequence contained in SEQ ID NO: 2", 
and therefore anticipates claims 93, 96 and 97. Finally, the record for AC092214.1 
indicates the sequence is from "genomic DNA" (see page 2) and therefore AC092214.1 
also anticipates instant claim 98, which depends from claim 93 and encompasses a 
"genomic sequence". 
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Conclusion 



No claims are allowable. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Zachary C. Howard whose telephone number is 571- 
272-2877. The examiner can normally be reached on M-F 9:30 AM - 6:00 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Gary B. Nickol can be reached on 571-272-0835. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



zch 




GARY B. NICKOL, PH.D. 
SUPERVISORY PATENT EXAMINER 
TECHNOLOGY CENTER 1600 



10628464-l_vs_AC092214 . 2 

GenCore version 6.2 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: February 2, 2007, 15:17:51 ; Search time 21 Seconds 

(without alignments) 
19.095 Million cell updates/sec 

Title: US-10-628-464-1 
Perfect score: 956.6 

Sequence: 1 atgaatggagaccacatggt caaggtgtgggacaccttga 957 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 0.? 

Searched: 1 seqs, 209501 residues 

Total number of hits satisfying chosen parameters: 2 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : acv2.txt:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Matcn Length DB ID Description 



c 1 956.6 100.0 209501 1 AC092214 ACCESSION :AC092214 

2 26.4 2.8 209501 1 AC092214 ACCESSION :AC092214 



ALIGNMENTS 



RESULT 1 

AC092214/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AC092214 209501 bp DNA linear HTG 15-SEP-2001 

Homo sapiens chromosome 7 clone RP11-81U9, WORKING DRAFT SEQUENCE, 
3 unordered pieces. 
AC092214 

AC092214.2 Gl: 15625026 

HTG; HTGS_PHASE1; HTG S_D RAFT; HTGS_ACTIVEFIN . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; craniata; vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 209501) 
Waterston,R.H. 

The sequence of Homo. sapiens clone 
unpublished 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



10628464-l_vs_AC092214 . 2 
2 (bases 1 to 209501) 
waterston,R.H. 
Direct Submission 

Submitted (29-JUN-2001) Genome Sequencing Center, Washington 
University School of Medicine, 4444 Forest Park Parkway, St. Louis 
MO 63108, USA 

[WARNING] On Nov 15, 2001 this sequence was replaced by 
gi: 16931065. 

On Sep 15, 2001 this sequence version replaced gi:14575807. 

Genome Center 

Center: Washington university Genome Sequencing Center 
Center code: WUGSC 

web site:http://genome.wustl .edu/gsc/index. shtml 

Project Information 

Center project name: h„nh081U09 



Summary Statistics 

Sequencing vector: M13; 1% 
Sequencing vector: plasmid; 99% 
Chemistry: Dye-primer ET; 0% of reads 
Chemistry: Dye-terminator Big Dye; 100% of reads 
Assembly program: Phrap; version 0.990319 
Consensus quality: 208290 bases at least Q40 
Consensus quality: 208711 bases at least Q30 
Consensus quality: 209110 bases at least Q20 



NOTE: This is a 'working draft 1 sequence 
consists of 3 contigs. The true order of 
is not known and their order in this sequ 
arbitrary. Gaps between the contigs are r 
runs of N, but the exact sizes of the gap 
This record will be updated with the fmi 
as soon as it is available and the access 
be preserved 



FEATURES 

source 



misc_feature 
gap 

misc_feature 
gap 

misc_feature 



1 4259: contig of 4259 bp in 1 
4260 4359: gap of unknown length 
4360 9850: contig of 5491 bp in 1 
9851 9950: gap of unknown length 
9951 209501: contig of 199551 bp in 
Location/Qual i f i ers 
1. .209501 

/organi sm="Homo sapiens" 
/mo l_type="genomic dna" 
/db_xref="taxon:9606" 
/chromosome="7" 
/clone= M RPll-81l:i9" 
1. .4259 

/note= M assembl y_name : Conti gll" 
4260.. 4359 

/esti mated_l ength=unknown 
4360.. 9850 

/note= M assembly_name : Conti gl2" 
9851. .9950 

/estimated_length=unknown 
9951. .209501 

/note= M assembl y_name : Conti gl3" 



It currently 
the pieces 
ence record is 
epresented as 
s are unknown, 
shed sequence 
ion number will 

ength 

ength 

length. 



Query Match 100.0%; Score 956.6; 

Best Local similarity 99.9%; Pred. No. 0; 
Matches 956 ; Conservative 1; Mi smatches 



DB 1; Length 209501; 
0; Indels 0; Gaps 



Qy 



1 ATGAATGGAGACCACATGGTTCTAGGATCTTCGGTGACTGACAAGAAGGCCATCATCTTG 60 
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I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGAATGGAGACCACATGGTTCTAGGATCTTCGGTGACTGACAAGAAGGCCATCATCTTG 



Db 


15olU/ 


a Tr a ATrr a /~ a /~/~ a /~ a tc~ t — i — vt — r s f~ (~ \ ~vt — i — v r~ r r~r r a t — re a f a a /* a a (-rr-r a t/~ a ~t/ — i — rr~ 
ATGAA 1 GGAGALLALA I GG1 i LTAGGA 1 L 1 1 LGG 1 GAL 1 GALAAGAAGGCCATCATCTTG 


1 con/i o 
15oU4o 


Qy 


61 


GTTACCATTTTACI CC 1 1 1 1 ACGCCTGGTAGCAATAGCAGGCAATGGCTTCATCACTGCT 


120 




1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 II 1 1 1 




Db 


158U47 


GTTACCA 1 1 1 1 AC- 1 LL 1 1 1 1 ACGCCTGGTAGCAATAGCAGGCAATGGCTTCATCACTGCT 


15/yoo 


Qy 


121 


GCTCTGGGCGTGGAGTGGGTGCTACGGAGAATGTTGTTGCCTTGTGATAAGTTATTGGTT 


180 




1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 C7007 

i5/yo7 


/•rTrTrrrrrTrr a TTrrrTrrT a rrr a r a at^"t — rrnrrn — rrTr ata a /~t — tat — rrn — r 

GCTLTGGGCGTGGAGTGGGTGCTALGGAGAATGTTGTTGLLTTGTGATAAGTTATTGGTT 


15 /yzo 


Qy 


181 


AGCCTAGGGGCCTCTCGCTTCTGTCTGCAGTCAGTGGTAATGGGTAAGACCATTTATGTT 


240 




1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 




AGLL 1 AGGGGLLTLTLGLTTLTGTLTGLAG 1 LAG 1 GG rAATGGGTAAGACCATTTATGTT 


15 /obo 


Qy 


241 


TTCTTGCATGCGATGGCCTTCCCATACAACCCTGTACTGCAGTTTCTAGC 1 1 I CCAGTGG 


300 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 




Db 


1 C 7QC7 

15/ob/ 


1 1 L 1 1 GCATCCGATGGCCTTCCCATACAACCCTGTACTGCAG I ! I CTAGCTTTCCAGTGG 


1 C 7QAQ 

15/oUo 


Qy 


301 


GACTTCCTGAATGCTGCCACCTTATGGTCCTCTACCTGGCTCAGTGTCTTCTATTGTGTG 


360 




1 1 1 1 1 1 I 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 ! 1 II 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 C 70A7 

157807 


GACTTCCTGAATGCTGCCACCTTATGGTCCTCTACCTGGCTCAGT 


157748 


Qy 


361 


AAAATTGCTACCTTCACCCACCCTGTCTTCTTCTGGCTAAAGCACAAGTTGTCTGGGTGG 


420 




1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 < II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




r-v U 

Db 


1 C77/I7 

157747 


AAAATTGCTACCTTCACCCACCCTGTCTTCTTCTGGCTAAAGCACAAGTTGTCTGGGTGG 


157688 


Qy 


421 


CTACCATGGATGCTCTTCAGCTCTGTAGGGCTCTCCAGCTTCACCACCATTCTATTTTTC 


480 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 




Db 


"1 C 7CO 7 

Id/do/ 


CTACCATGGATGCTCTTCAGCTCTGTAGGGCTCTCCAGCTTCACCA 1 I 1 1 1 C 


1 C 70 Q 

15 /bzfo 


Qy 


481 


ATAGGCAACCACAGAATGTATCAGAACTA 1 1 1 AAGGAACCATCTACAACCTTGGAATGTC 


540 




1 1 II 1 1! 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


1 C 7CT 7 

15/bz7 


A A ^™ A A A y~ A ^* A A T/*T A f A A A A III A A A A /"* A ^P/"™T A A A /"* ^"11 A A "T" T /"" 

ATAGGCAACCACAGAATGTATCAGAACTA I 1 I AAGGAACCATCTACAACCTTGGAATGTC 


15/5bo 


Qy 


541 


ACTGGCGATAGCATACGGAGCTACTGTGAGAAATTCTATCTCTTCCCTCTAAAAATGATT 


600 




1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 C 7CC7 

lb/ bb/ 


A /~T/~ /~ /~ /~ A ~I~ A /~/~ A ~T" A /~/~/~ a ^~^~T A /~ ~T~/~ "I - /*" A /~ A A A I | ^-~T~ A T/*Tn 1 <T/*TrT A A A A A ~T~ f~ A 1 1 

ACTGGCGATAGCATACGGAGCTACTGTGAGAAATTCTATCTCTTCCCTCTAAAAATGATT 


15/ 5Uo 


Qy 


601 


ACTTGGACAATGCCCACTGCTGTC 1 III 1 CATTTGCATGATTTTGCTCATCACATCTCTG 


660 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


1 C 7 C A 7 

157507 


ACTTGGACAATGCCCACTGCTGTC 1 1 1 1 1 CATTTGCATGA I 1 1 1 GCTCATCACATCTCTG 


15744o 


Qy 


661 


GGAAGACACAGGAAGAAGGCTCTCCTTACAACCTCAGGATTCCGAGAGCCCAGTGTGCAG 


720 




1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 




Db 


1 C 7/1 >1 7 

15/447 


GGAAGACACAGGAAGAAGGCTCTCCTTACAACCTCAGGATTCCGAGAGCCCAGTGTGCAG 


15/ ioo 


Qy 


721 


GCACACATAAAGGCTCTGCTGGCTCTCC I C 1 C 1 I I 1 GCCATGCTCTTCATCTCATATTTC 


780 




II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 LI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




UD 


1j/jo/ 


GCACACATAAAGGCTCTGCTGGCTCTCCTL 1 L 1 1 I 1 GCCATGC T CTTCATC 1 LA 1 A 1 1 1 L 




Qy 


781 


CTGTCACTGGTGTTCAGTGCTGCAGGTA Mill CCACCTCTGGACTTTAAATTCTGGGTG 


840 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 




Db 


157327 


CTGTCACTGGTGTTCAGTGCTGCAGGTA 1 1 1 1 1 CCACCTCTGGACTTTAAATTCTGGGTG 


157268 


Qy 


841 


TGGGAGTCAGTGATTTATCTGTGTGCAGCAGTTCACCCCATCATTCTGCTCTTCAGCAAC 


900 


157267 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M M 1 1 1 1 1 1 




Db 


TGGGAGTCAGTGATTTATCTGTGTGCAGCAGTTCACCCCATCATTCTGCTCTTCAGCAAC 


157208 


Qy 


901 


TGCAGGCTGAGAGCTGTGCTGAAGAGTCGYCGTTCCTCAAGGTGTGGGACACCTTGA 957 


Db 


157207 


III MM III III II II III III III llhll III Mil III Mill III II Mill 




TGCAGGCTGAGAGCTGTGCTGAAGAGTCGTCGTTCCTCAAGGTGTGGGACACCTTGA 157151 
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OM protein 
Run on: 



ac092214.2 

GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 

nucleic search, using f rame_plus_p2n model 

January 25, 2007, 15:53:17 ; Search time 61 Seconds 

(without alignments) 
3.276 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



US-10-628-464-2 
1678 

1 MNGDHMVLGSSVTDKKAIIL . 



, SNCRLRAVLKSRRSSRCGTP 318 



BLOSUM62 

xgapop 10.0 
Ygapop 10.0 
Fgapop 6.0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



1 seqs, 209501 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: 



Minimum Match 0% 

Maximum Match 100% 

Listing first 500 summaries 



Command line parameters: 

-MODEL=frame+_p2n. model -DEV=soft -Q=US10628464 . pep -DB=acv2.txt -SUFFlx=pto 
-OUT=ac092214.2.res -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 -UNITS=bitS -START=1 
-END=-1 -MATRlX=blosum62 -TRANS=human40 . cdi -LIST=500 -DOCALIGN=200 
-THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=50 -MODE=LOCAL -OUTFMT=ptO 
-NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 -NCPU=6 -NO_XLPXY 
-NEG_SCORES=0 -LONGLOG -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP=6 -FGAPEXT=7 
-YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database 



acv2.txt:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 

Result Query 

No. Score Match Length DB ID 



SUMMARIES 



Description 



1 1678 100.0 209501 1 AC092214 

2 80 4.8 209501 1 AC092214 



ACCESSION:AC092214 
ACCESSION:AC092214 



ALIGNMENTS 

RESULT 1 
AC092214/C 

LOCUS AC092214 209501 bp DNA linear HTG 15-SEP-2001 

DEFINITION Homo sapiens chromosome 7 clone RP11-81U9, WORKING DRAFT SEQUENCE, 
3 unordered pieces. 
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ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



ac092214.2 

AC092214 

AC092214.2 Gl:15625026 

HTG ; HTGS_PHASEl; HTGS_DRAFT; HTGS_ACTIVEFIN . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 209501) 
waterston,R.H. 

The sequence of Homo sapiens clone 
Unpublished 

2 (bases 1 to 209501) 
waterston,R.H. 
Direct Submission 

Submitted (29-JUN-2001) Genome Sequencing Center, Washington 
University school of Medicine, 4444 Forest Park Parkway, St. Louis, 
MO 63108, USA 

[WARNING] On Nov 15, 2001 this sequence was replaced by 
gi: 16931065. 

On Sep 15, 2001 this sequence version replaced gi: 14575807. 



Genome Center 

Center: Washington university Genome Sequencing Center 
Center code: WUGSC 

web si te : http : //genome . wustl . edu/gsc/i ndex . shtml 

Project information 

Center project name: H_NH081U09 



Summary Statistics 

Sequencing vector: M13; 1% 
Sequencing vector: plasmid; 99% 
Chemistry: Dye-primer ET; 0% of reads 
Chemistry: Dye-terminator Big Dye; 100% of reads 
Assembly program: Phrap; version 0.990319 
Consensus quality: 208290 bases at least Q40 
Consensus quality: 208711 bases at least Q30 
Consensus quality: 209110 bases at least Q20 



FEATURES 

source 



misc_feature 
gap 

misc_feature 



NOTE: This is a 'working draft 1 sequence. It currently 
consists of 3 contigs. The true order of the pieces 
is not known and their order in this sequence record is 
arbitrary. Gaps between the contigs are represented as 
runs of N, but the exact sizes of the gaps are unknown. 
This record will be updated with the finished sequence 
as soon as it is available and the accession number will 
be preserved. 

1 4259: contig of 4259 bp in length 
4260 4359: gap of unknown length 
4360 9850: contig of 5491 bp in length 
9851 9950: gap of unknown length 
9951 209501: contig of 199551 bp in length. 
Locati on/Qual i f i e rs 
1. .209501 

/organism= n Homo sapiens" 
/moi_type="genomic DNA" 
/db_xref="taxon:9606" 
/chromosome= ,, 7 M 
/clone= ,, RPll-81lJ9" 
1. .4259 

/note= M assembl y_name : Conti gll" 
4260.. 43 59 

/esti mated_l ength=unknown 
4360.. 9850 
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gap 

misc_feature 



ac092214.2 
/note="assembly_name:Contigl2" 
9851.. 9950 

/esti mated_l ength=unknown 
9951. .209501 

/note="assembl y_name : Conti gl3 " 



Alignment Scores: 
Pred. No. : 
Score: 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
db: 



0 

1678.00 

100.0% 

100.0% 

100.0% 

1 



Length: 
Matches: 
Conservative: 
Mismatches: 
indels : 
Gaps : 



209501 

318 

0 

0 

0 

0 



US-10-628-464-2 (1-318) x AC092214 (1-209501) 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



1 MetAsnGlyAspHisMetvalLeuGlySerServalThrAspLysLysAlaileileLeu 20 
1 1 1 I 1 1 1 I 1 1 I 1 1 1 I 1 1 II 1 1 1 I I II I 1 1 1 1 1 I I 1 1 I 1 1 I 1 1 1 1 I 1 1 I 1 1 1 1 1 1 1 M I II 

15 8 107 ATGAATGGAGACCACATGGTTCTAGGATCTTCGGTGACTGACAAGAAGGCCATCATCTTG 158048 

21 valThrileLeuLeuLeuLeuArgLeuvalAlaileAlaGlyAsnGlyPheileThrAla 40 
MINI M III I II II II II I! IIII III! MM I II ! M I Ml I I NIMH IIIMM 

158047 GTTACCATTTTACTCCTTTTACGCCTGGTAGCAATAGCAGGCAATGGCTTCATCACTGCT 157988 

41 AlaLeuGlyValGluTrpValLeuArgArgMetLeuLeuProCysAspLysLeuLeuVal 60 
IIII MM II MM I II lllllll Mil Mill III Ml I II III II M II MM I III I 

157987 GCTCTGGGCGTGGAGTGGGTGCTACGGAGAATGTTGTTGCCTTGTGATAAGTTATTGGTT 157928 

61 SerLeuGlyAlaSerArgPheCysLeuGlnServalvalMetGlyLysThrlleTyrval 80 
MMMMIMMMIMIMI MMMMMMIIMM MIMIII IMMMMIM 

157927 AGCCTAGGGGCCTCTCGCTTCTGTCTGCAGTCAGTGGTAATGGGTAAGACCATTTATGTT 157868 



81 



100 



PheLeuHi sProMetAl aPheProTy rAsnProVal LeuGl nPheLeuAl aPheGl nTrp 
MIMIIII III II IIII MMMMMMIIMM II lllllll II II III II II MM 

157867 TTCTTGCATCCGATGGCCTTCCCATACAACCCTGTACTGCAGTTTCTAGCTTTCCAGTGG 157808 

101 AspPheLeuAsnAlaAlaThrLeuTrpSerSerThrTrpLeuSerVal PheTyrCysVal 120 
I I 1 1 1 I 1 1 1 1 1 1 1 J 1 1 I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 I I 1 1 I i 1 1 1 1 I I 1 1 I I 1 1 I 1 1 1 

157807 GACTTCCTGAATGCTGCCACCTTATGGTCCTCTACCTGGCTCAGTGTCTTCTATTGTGTG 157748 

121 LyslleAlaThrPheThrHisProvalPhePheTrpLeuLysHi sLysLeuSerGlyTrp 140 
II I I 1 1 II I II II II II I I M I 1 1 II 1 1 I II I M I II I II II II M II II I I M M II I I 

157747 AAAATTGCTACCTTCACCCACCCTGTCTTCTTCTGGCTAAAGCACAAGTTGTCTGGGTGG 157688 

141 LeuProTrpMetLeuPheSerServalGlyLeuSerSerPheThrThrileLeuPhePhe 160 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

157687 CTACCATGGATGCTCTTCAGCTCTGTAGGGCTCTCCAGCTTCACCACCATTCTATTTTTC 157628 



161 



180 



II eGl yAsnHi sArgMetTyrGl nAsnTy r LeuArgAsnHi sLeuGl nProTrpAsnVal 

lit 1 1 1 T 1 1 1 1 1 1 1 1 1 1 1 1 r i i 1 1 1 1 1 T 1 1 1 1 1 1 1 1 1 [ i e 1 1 1 1 1 1 1 

157627 ATAGGCAACCACAGAATGTATCAGAACTATTTAAGGAACCATCTACAACCTTGGAATGTC 157568 

181 ThrGlyAspSerileArgSerTyrCysGluLysPheTyrLeuPheProLeuLysMetlle 200 
MMM IIII MIMIMIM lllllllllllllllll MMIMIMM III lllllll 
157567 ACTGGCGATAGCATACGGAGCTACTGTGAGAAATTCTATCTCTTCCCTCTAAAAATGATT 157508 

201 ThrTrpThrMetProThrAlaValPhePheileCysMetlleLeuLeuileThrSerLeu 220 
III MIIIMI II II III I MM IIMJI II II I III MM III III I III II II II II I 

157507 ACTTGGACAATGCCCACTGCTGTC I I I I I CATTTGCATGATTTTGCTCATCACATCTCTG 157448 

221 GlyArgHisArgLysLysAlaLeuLeuThrThrSerGlyPheArgGluProServalGln 240 
MMMIMMllIMM IIIIIIIIIIIIMMIII MMMMMMIIMM IIII I 

157447 GGAAGACACAGGAAGAAGGCTCTCCTTACAACCTCAGGATTCCGAGAGCCCAGTGTGCAG 157388 
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ac092214.2 

Qy 241 AlaHislleLysAlaLeuLeuAlaLeuLeuSerPheAlaMetLeuPhelleSerTyrPhe 260 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I II 11 I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 
Db 157387 GCACACATAAAGGCTCTGCTGGCTCTCCTCTCTTTTGCCATGCTCTTCATCTCATATTTC 157328 

Qy 261 LeuSerLeuvalPheSerAlaAlaGlyllePheProProLeuAspPheLysPheTrpVal 280 

1 I I I I I I 1 I 1 I I I 1 I I I 1 I I I I 1 1 I I I I I I i I i I I I I 1 I I I I I I i I 1 1 I I I I I I I 1 I 1 I 1 
Db 157327 CTGTCACTGGTGTTCAGTGCTGCAGGTA I I I I I CCACCTCTGGACTTTAAATTCTGGGTG 157268 

Qy 281 TrpGluSerVallleTyrLeuCysAlaAlaValHisProlleileLeuLeuPheSerAsn 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 157267 TGGGAGTCAGTGATTTATCTGTGTGCAGCAGTTCACCCCATCATTCTGCTCTTCAGCAAC 157208 

Qy 301 CysArgLeuArgAlaVal LeuLysSerArgArgSerSerArgCysGlyThrPro 318 

IIIIITIIIMTIIIMIIIMIMIIIITIITIIIIIIIITIIIIIIIIIIII 

Db 157207 TGCAGGCTGAGAGCTGTGCTGAAGAGTCGTCGTTCCTCAAGGTGTGGGACACCT 157154 



R&SULT 2 
AC0$2214 
LOCI 
DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



AC092214 209501 bp DNA linear HTG LS^SEP-2001 

Homo sapiens chromosome 7 clone RP11-81U9, WORKING DRyyf SEQUENCE, 
3 unordered pieces. 
AC092214 

AC092214.2 Gl:15625026 

;ITG; HTGS_PHASE1; HTG S_D RAFT; HTGS_ACTIVEFIN . 

mo sapiens (human) 
Holtjo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Ver^ebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini jyHominidae; Homo. 

1 (b^ses 1 to 209501) 
water stCvn,R.H. 

The sequence of Homo sapiens clone 
Unpublishe ' 

2 (bases l\o 209501) 
Waterston,R. 
Direct submission 
Submitted (29^1/^2001) Gen 
University School 
MO 63108, USA 
[WARNING] On NOV 15, 
gi : 16931065 

On Sep 15, 2001 thi s/^e^uence version replaced gi:14575807 



ffe Sequencing Center, Washington 
f Medicjfie, 4444 Forest Park Parkway, St. Louis, 

OjJl this sequence was replaced by 



^ Genome^enter 
Center: Washington Universi ty^Genome Sequencing Center 
Center code: wpGSC 

web site:http<//genome.wustl .edVgsc/index.shtml 

/- — Project Information 
Center project name: H_NH081U09 



Summary statistics - 

Seque/fcing vector: Ml3; 1% 
Sem/encing vector: plasmid; 99% 
Cl>emistry: Dye-primer ET; 0% of reads 

lemistry: Dye-terminator Big Dye; 100% of r^ds 
'Assembly program: Phrap; version 0.990319 
Consensus quality: 208290 bases at least Q40 
Consensus quality: 208711 bases at least Q30 
Consensus quality: 209110 bases at least Q20 

* NOTE: This is a f working draft 1 sequence. It currentNy 

* consists of 3 contigs. The true order of the pieces 

* is not known and their order in this sequence record is 
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10628464-l_vs_AC092214 . 1 

GenCore version 6.2 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: February 2, 2007, 15:16:21 ; Search time 26 Seconds 

(without alignments) 
19.598 Million cell updates/sec 

Title: US-10-628-464-1 
Perfect score: 956.6 

Sequence: 1 atgaatggagaccacatggt caaggtgtgggacaccttga 957 

Scoring table: identity_nuc 

Gapop 10.0 , Gapext 0.5 

Searched: 1 seqs, 266224 residues 

Total number of hits satisfying chosen parameters: 2 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : acvl.txt:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 802.4 83.9 266224 1 AC092214 ' ACCESSION :AC092214 

c 2 252.8 26.4 266224 1 AC092214 ACCESSION :AC092214 



ALIGNMENTS 



RESULT 1 
AC092214 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AC092214 266224 bp DNA linear HTG 29-JUN-2001 

Homo sapiens chromosome UNK clone RP11-811J9, working draft 
sequence, 6 unordered pieces. 
AC092214 

AC092214.1 Gl:14575807 

HTG; HTGS_PHASEl; HTG S_D RAFT; HTGS_FULLTOP . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; catarrhini; Hominidae; Homo. 
1 (bases 1 to 266224) 
Waterston,R.H. 

The sequence of Homo sapiens clone 
Unpublished 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



10628464-l_vs_AC092214 . 1 
2 (bases 1 to 266224) 
waterston ,R.H. 
Direct Submission 

Submitted (29-JUN-2001) Genome Sequencing Center, Washington 
university School of Medicine, 4444 Forest Park Parkway, St. Louis, 
MO 63108, USA 

[WARNING] On Sep 15, 2001 this sequence was replaced by 
gi : 15625026. 

Genome Center 

Center: Washington University Genome Sequencing Center 
Center code: WUGSC 

Web site:http://genome.wustl . edu/gsc/index.shtml 

Project information 

Center project name: H_NH081U09 

Summary statistics 

Sequencing vector: M13; 1% 
Sequencing vector: plasmid; 98% 
Chemistry: Dye-primer ET; 0% of reads 
Chemistry: Dye-terminator Big Dye; 99% 
Assembly program: Phrap; version 0. 
Consensus quality: 203494 bases at 
Consensus quality: 204566 bases at 
Consensus quality: 264940 bases at 
Insert size: 210000; agarose-fp 
insert size: 265724; sum-of-contigs 
Quality coverage: 7.96 in Q20 bases; agarose-fp 
Quality coverage: 6.30 in Q20 bases; sum-of-contigs 



of reads 
.990319 
least Q40 
least Q30 
least Q20 



NOTE: This is a 'working draft' sequence. It currently 
consists of 6 contigs. The true order of the pieces 
is not known and their order in this sequence record is 
arbitrary. Gaps between the contigs are represented as 
runs of N, but the exact sizes of the gaps are unknown. 
This record will be updated with the finished sequence 
as soon as it is available and the accession number will 
be preserved. 

of 1883 
unknown 
of 1081 
unknown 
of 4659 
unknown 
of 6448 
unknown 
of 51041 



1 

1884 
1984 
3065 
3165 
7824 
7924 
14372 
14472 
65513 
65613 



FEATURES 

source 



contig 
gap of 
conti 
gap 

conti q 
gap of 
conti 
gap 

contig 
gap of 
contig 



misc_feature 
gap 

misc_feature 
gap 



1883: 
1983: 
3064: 
3164: 
7823: 
7923: 
14371: 
14471: 
65512: 
65612: 
266224: 
Locati on/Qual i f i e r s 
1. .266224 

/organism="Homo sapiens" 
/mo l_type="genomic dna" 
/db_xref="taxon:9606" 
/chromosome="UNK" 
/clone^RPll-Slljg" 
1. .1883 

/note="assembl y_name : conti gll 
1884.. 1983 

/estimated_length=unknown 
1984.. 3064 

/note="assembl y_name : Conti gl2 
3065. .3164 
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bp in length 
length 

bp in length 
length 

bp in length 
length 

bp in length 
length 
bp in length 
unknown length 
of 200612 bp in length. 



misc_feature 
gap 

mi sc_feature 
gap 

misc_feature 
gap 

misc_feature 



10628464-l_vs_AC092214 . 1 
/estimated_length=unknown 
3165.. 7823 

/note="assembl y_name : Conti gl3 " 
7824.. 7923 

/estimated_Jength=unknown 
7924.. 14371 

/note="assembl y_name : Conti gl4" 
14372.. 14471 

/esti mated_l ength=unknown 
14472. .65512 

/note="assembl y_name : conti gl5" 
65513. .65612 

/esti mated_l ength=unknown 
65613.. 266224 

/note="assembl y_name : Conti gl6" 



Query Match 83.9%; 
Best Local similarity 97.6%; 
Matches 823; Conservative 



Score 802.4; DB 1; 
Pred. No. 0; 
1; Mismatches 17; 



Length 266224; 
indels 2; Gaps 



Qy 


117 


Db 


7960 


Qy 


175 


Db 


8020 


Qy 


235 


Db 


8080 


Qy 


295 


Db 


8140 


Qy 


355 


Db 


8200 


Qy 


415 


Db 


8260 


Qy 


475 


Db 


8320 


Qy 


535 


Db 


8380 


Qy 


595 


Db 


8440 


Qy 


655 


Db 


8500 


Qy 


715 


Db 


8560 



TGCTGCTCTGGGCGTGGAGTGGGTG— CTACGGAGAATGTTGTTGCCTTGTGATAAGTTA 

I Mill 1 1 1 1 MINI Mill I MMMMMIMI lllllllllll 

TTCTGCTGTGGGAGTGGAGGGGGTGGCTACGGAGGAATGTTGTTGCCTGGTGATAAGTTA 
TTGGTTAGCCTAGGGGCCTCTCGCTTCTGTCTGCAGTCAGTGGTAATGGGTAAGACCATT 

IMMIIII M I I I II I I II I M I I M M, MINIMUM ! MIMIIIMIII 

TTGGTTAGCTTAGGGGCCTCTCGCTTCTTTCTGCAGTCAGTGGTAAGGGGTAAGACCATT 
TATGTTTTCTTGCATCCGATGGCCTTCCCATACAACCCTGTACTGCAGTTTCTAGCTTTC 

IMIIMI I 1 1 II 1 1 1 1 1 1 1 1 1 I I II I M 1 1 1 1 1 1 I II 1 1 I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

TATGTTTTGTTGCATCCGATGGCCTTCCCATACAACCGTGTACTGCAGTTTCTAGCTTTC 
CAGTGGGACTTCCTGAATGCTGCCACCTTATGGTCCTCTACCTGGCTCAGTGTCTTCTAT 

I I I II I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I M I II II 

CAGTGGGACTTCCTGAATGCTGCCACCTTATGGTCCTCTACCTGGCTCAGTGTCTTCTAT 

TGTGTGAAAATTGCTACCTTCACCCACCCTGTCTTCTTCTGGCTAAAGCACAAGTTGTCT 

II I II I II I I I I I I I I I I I II I I II II I II I I I I I I I I I II I I I I I I I I II I I I I I I I I 
TGTGTGAAAATTGCTACCTTCACCCACCCTGTCTTCTTCTGGGTAAAGCACAAGTTGTCT 

GGGTGGCTACCATGGATGCTCTTCAGCTCTGTAGGGCTCTCCAGCTTCACCACCATTCTA 

Ml MIMMMIMI MM IMMMMIMI Ml MM M M MM I M M M M M 

GGGTGGCTACCATGGATGCTCTTCAGCTCTGTAGGGCTCTCCAGCTTCACCACCATTCTA 



I I I I I CATAGGCAACCACAGAATGTATCAGAACTATTTAAGGAACCATCTACAACCTTGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I II II 
I I I I I CATAGGCAACCACAGAATGTATCAGAACTATTTAAGGAACCATCTACAACCTTGG 

AATGTCACTGGCGATAGCATACGGAGCTACTGTGAGAAATTCTATCTCTTCCCTCTAAAA 

MMMMMMIMM MMMMMMMMMMMMMMMMIMIMM II I 

AATGTCACTGGCGATAGCATACGGAGCTACTGTGAGAAATTCTATCTCTTCCCTCTAAAA 
ATGATTACTTGGACAATGCCCACTGCTGTC I I I I I CATTTGCATGATTTTGCTCATCACA 

MM I III IMMIIII II II II II III 1 1 IMMIIII II 1 1 II II II II II II 1 1 II I 

ATGATTACTTGGACAATGCCCACTGCTGTC I I I I I CATTTGCATGATTTTGCTCATCACA 
TCTCTGGGAAGACACAGGAAGAAGGCTCTCCTTACAACCTCAGGATTCCGAGAGCCCAGT 

M I I I 1 1 M 1 1 i M M I I I I I I I i I I 1 1 II 1 1 1 I I 1 1 I I 1 1 I II I M I I I I I I M I I M I 

TCTCTGGGAAGACACAGGAAGAAGGCTCTCCTTACAACCTCAGGATTCCGAGAGCCCAGT 



GTGCAGGCACACATAAAGGCTCTGCTGGCTCTCCTCTCTTT 

I Mill III I III III Ml III I IMIMIII MM Ml I 



GCCATGCTCTTCATCTCA 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 ! I 

GTGCAGGCACACATAAAGGCTCTGCTGGCTCTCCTCTCTTTTGCCATGCTCTTCATCTCA 
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1; 
174 
8019 
234 
8079 
294 
8139 
354 
8199 
414 
8259 
474 
8319 
534 
8379 
594 
8439 
654 
8499 
714 
8559 
774 
8619 



10628464-l_vs_AC092214 . 1 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



775 TATTTCCTGTCACTGGTGTTCAGTGCTGCAGGTATTTT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
8620 TATTTCCTGTCACTGGTGTTCAGTGCTGCAGGTATTTT 



TCCACCTCTGGACTTTAAATTC 834 

I II III II II II! II II III II 

TCCACCTCTGGACTTTAAATTC 8679 



835 
8680 

895 
8740 



TGGGTGTGGGAGTCAGTGATTTATCTGTGTGCAGCAGTTCACCCCATCATTCTGCTCTTC 894 
I I I I I I I I I ! I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGGGTGTGGGAGTCAGTG ATTTATCTGTGTGCAGCAGTTCACCCCATCATTCTGCTCTTC 8739 

AGCAACTGCAGGCTGAGAGCTGTGCTGAAGAGTCGYCGTTCCTCAAGGTGTGGGACACCT 954 
I I I I I I I I I I I I I I I I I I I I I I I I I 111 Ml I I I hi I I I I I I I I I I I I I I I I II I I I I I 
AGCAACTGCAGGCTGAG AGCTGTGCTGAAGAGTCGTCGTTCCTCAAGGTGTGGGACACCT 8799 



955 TGA 957 
III 

8800 TGA 8802 



29-JUN-2001 
DRAFT 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



HTGS_FULLTOP . 



Craniata- 



/ertebrata ; Eutel eostomi ; 
Hominidae; Homo. 



AC092214 266224 bp DNA linear 

Homo sapiens chromosome UNK clone RP11-81U9, WORKI 
SEQUENCE, 6 unordered pieces. 
AC092214 

AC092214.1 Gl: 14575807 
jTG ; HTGS^PHASEl; HTGS_DRAFT; 

mo sapiens (human) 
Homo sapiens 

Eukanryota; Metazoa; chordata; 
Mammalia; Eutheria; Primates; Catarrhi 

1 (ba^es 1 to 266224) 
waterston,R.H. 

The segueKfe of Homo sapiens clo 
unpublishe ' 

2 (bases l\o 266224) 
waterston,R.H 
Direct Submission 

submitted (29-JU^2001) Genome Sequencing Center, Washington 
university School \>f MejKcine, 4444 Forest Park Parkway, St. Louis, 
MO 63108, USA 

[WARNING] On Sep 15,^001 this sequence was replaced by 
gi: 15625026 

Genonta Center 

Center: Washington university Genome Sequencing center 
Center code: jtfUGSC 

Web si te : htjp : //genome . wustl . ^ski/gsc/i ndex . shtml 

Project Information 
Center p/6ject name: H_NH081U09 

Summary statistics 
Sequencing vector: M13; 1% 
Sequencing vector: plasmid; 98% 
r 1emistry: Dye-primer ET; 0% of reads 
Chemistry: Dye-terminator Big Dye; 99% of r\ 
Assembly program: Phrap; version 0.990319 
Consensus quality: 203494 bases at least Q40 
Consensus quality: 204566 bases at least Q30 
Consensus quality: 264940 bases at least Q20 
Insert size: 210000; agarose-fp 
insert size: 265724; sum-of-contigs 
Quality coverage: 7.96 in Q20 bases; agarose-fp 
Quality coverage: 6.30 in Q20 bases; sum-of-contigs 
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ids 



ac092214.1 

GenCore version 5.1.9 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 25, 2007, 15:37:56 ; Search time 78 Seconds 

(without alignments) 
3.256 Million cell updates/sec 

US-10-628-464-2 
1678 

1 MNGDHMVLGSSVTDKKAIIL SNCRLRAVLKSRRSSRCGTP 318 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



BLOSUM62 

xgapop 10.0 
Ygapop 10.0 
Fgapop 6.0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 1 seqs, 266224 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: 



Minimum Match 0% 

Maximum Match 100% 

Listing first 500 summaries 



Command line parameters: 

-MODEL=frame+_p2n. model -rDEV=sof t -Q=USl0628464.pep -DB=acvl.txt -SUFFlX=pto 
-OUT=ac092214.1.res -MINMATCH=0.1 -LOOPCL=0 -LOOPEXT=0 -UNITS=bitS -START=1 
-END=-1 -MATRlx=blosum62 -TRANS=human40 . cdi -LIST=500 -DOCALIGN=200 
-THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=50 -MODE=LOCAL -OUTFMT=ptO 
-NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 -NCPU=6 -NO_XLPXY 
-NEG_SCORES=0 -LONGLOG -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP=6 -FGAPEXT=7 
-YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 



Database 



acvl.txt:* 



Pred. no. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 

Result Query 

No. Score Matcn Length DB 



SUMMARIES 



ID 



Description 



1 1407 83.8 266224 1 AC092214 

2 609 36.3 266224 1 AC092214 



ACCESSION :AC092214 
ACCESSION :AC092214 



ALIGNMENTS 

RESULT 1 
AC092214 

LOCUS AC092214 266224 bp DNA linear HTG 29-JUN-2001 

definition Homo sapiens chromosome UNK clone RP11-81U9, WORKING DRAFT 
SEQUENCE, 6 unordered pieces. 
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ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



ac092214.1 

AC092214 

AC092214.1 Gl: 14575807 

HTG; HTGS_PHASEl; HTGS_DRAFT; HTGS_FULLTOP . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 266224) 
waterston,R.H. 

The sequence of Homo sapiens clone 
Unpublished 

2 (bases 1 to 266224) 
waterston, R.H. 
Direct Submission 

Submitted (29-JUN-2001) Genome Sequencing Center, Washington 
University School of Medicine, 4444 Forest Park Parkway, St. Louis 
MO 63108, USA 

[WARNING] On Sep 15, 2001 this sequence was replaced by 
gi:15625026. 



Genome Center 

Center: Washington University Genome Sequencing Center 
Center code: WUGSC 

Web site:http://genome.wustl .edu/gsc/index. shtml 

Project information 

Center project name: H_NH081U09 



Summary Statistics 

Sequencing vector: Ml3; 1% 
Sequencing vector: plasmid; 98% 
Chemistry: Dye-primer ET; 0% of reads 
Chemistry: Dye-terminator Big Dye; 99% of reads 
Assembly program: Phrap; version 0.990319 
Consensus quality: 203494 bases at least Q40 
Consensus quality: 204566 bases at least Q30 
Consensus quality: 264940 bases at least Q20 
insert size: 210000; agarose-fp 
insert size: 265724; sum-of-contigs 
Quality coverage: 7.96 in Q20 bases; agarose-fp 
Quality coverage: 6.30 in Q20 bases; sum-of-contigs 



FEATURES 

source 



NOTE: This is a 'working draft' sequence. It currently 
consists of 6 contigs. The true order of the pieces 
is not known and their order in this sequence record is 
arbitrary. Gaps between the contigs are represented as 
runs of N, but the exact sizes of the gaps are unknown. 
This record will be updated with the finished sequence 
as soon as it is avai lable and the accession number will 

contiq of 1883 bp in length 

gap of unknown length 

contiq of 1081 bp in length 

gap of unknown length 

contiq of 4659 bp in length 

gap of unknown length 

contiq of 6448 bp in length 

gap of unknown length 
contiq of 51041 bp in length 

gap of unknown length 
contig of 200612 bp in length. 
Locati on/Qual i f i ers 
1.. 266224 

/organism="Homo sapiens" 
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* 


be preserved. 


* 


1 


1883: 




1884 


1983: 


* 


1984 


3064: 


* 


3065 


3164: 


* 


3165 


7823: 


* 


7824 


7923: 




7924 


14371: 


* 


14372 


14471: 


* 


14472 


65512: 


* 


65513 


65612: 


* 


65613 


266224: 



misc_feature 
gap 

misc_feature 
gap 

misc_feature 
gap 

misc_feature 
gap 

misc_feature 
gap 

misc_feature 



Alignment Scores: 
Pred. No. : 
Score: 

Percent Similarity: 
Best Local Similari 
Query Match: 
DB: 



ac092214.1 
/mol_type="genomic DNA" 
/db_xref="taxon:9606" 
/chromosome="UNK" 
/clone="RPll-81U9" 
1. .1883 

/note="assembl y_name : Conti gll" 
1884.. 1983 

/estimated_length=unknown 
1984.. 3064 

/note="assembl y_name : Conti gl2 " 
3065.. 3164 

/estimated_l ength=unknown 
3165. .7823 

/note="assembl y_name : Conti gl3 ' 
7824.-7923 

/estimated_length=unknown 
7924.. 14371 

/note="assembl y_name : Conti gl4" 
14372.. 14471 

/estimated_l ength=unknown 
14472. .65512 

/note="assembly_name : Conti gl5' 
65513.. 65612 

/estimated_length=unknown 
65613.. 266224 

/note="assembl y_name : Conti gl6" 



ty: 



0 

1407.00 
96.5% 
.5% 
.8% 



95. 
83. 
1 



Length : 
Matches : 
Conservative : 
Mi smatches : 
indels : 
Gaps : 



266224 

276 

3 

6 

4 

0 



US-10- 


-628-464 


Qy 


34 


Db 


7941 


Qy 


52 


Db 


8001 


Qy 


72 


Db 


8061 


Qy 


92 


Db 


8121 


Qy 


112 


Db 


8181 


Qy 


132 


Db 


8241 


Qy 


152 



-318) x AC092214 (1-266224) 



GlyAsnGlyPhe-lleThr-AlaAlaLeuGlyValGluTrpVal-LeuArgArg-MetLe 52 
llllllllllll I I I I I I I |:::| I I I I I I I I III Mllli'llT Mill 

GGCAATGGCTTCAATCACTTTCTGCTGTGGGAGTGGAGGGGGTGGCTACGGAGGAATGTT 8000 

uLeuProCysAspLysLeuLeuValserLeuGl yAlaSerArgPheCysLeuGlnServa 72 

ill ill iiiiiiilliillillllllllllllllllllTlll in i i n 

GTTGCCTGGTGATAAGTTATTGGTTAGCTTAGGGGCCTCTCGCTTCTTTCTGCAGTCAGT 8060 

lValMetGlyLysThrileTyrvalPheLeuHisProMetAlaPheProTyrAsnProva 92 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II 
GGTAAGGGGTAAGACCATTTATGTTTTGTTGCATCCGATGGCCTTCCCATACAACCGTGT 8120 

1 LeuGl nPheLeuAl aPheGl nTrpAspPheLeuAsnAl aAl aTh rLeuTrpse rSerTh 112 
I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I II I I I I I I I M I I I I 

ACTGCAGTTTCTAGCTTTCCAGTGGGACTTCCTG AATGCTGCCACCTTATGGTCCTCTAC 8180 

rTrpLeuServal PheTyrCysval LysileAlaThrPheThrHisProval PhePheTr 132 
I I M I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I M I I I I I I I I M 

CTGGCTCAGTGTCTTCTATTGTGTGAAAATTGCTACCTTCACCCACCCTGTCTTCTTCTG 8240 

pLeuLysHisLysLeuSerGlyTrpLeuProTrpMetLeuPheSerSerValGlyLeuSe 152 
I:::| I I I II I I I I I I I I I I I I I I I I I I II III II III II MM II III III II M II II 
GGTAAAGCACAAGTTGTCTGGGTGGCTACCATGG ATGCTCTTCAGCTCTGTAGGGCTCTC 8300 



rSerPheThrThrlleLeuPhePhelleGlyAsnHisArgMetTyrGlnAsnTyrLeuAr 
I M II I II II 1 1 II II I II I II II II II I II II M II I I T I II 1 1 1 II I II I II II II 1 1 
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172 



0 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



8301 
172 



ac092214.1 

CAGCTTCACCACCATTCTA I I I 1 I CATAGGCAACCACAGAATGTATCAGAACTATTTAAG 8360 



gAsnHi sLeuGl nProTrpAsnvalThrGlyAspserlleArgSerTyrCysGl uLysPh 192 
II I I I I I I I I ! I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 111 I I I I I I I I I I I I I I I I 
8361 GAACCATCTACAACCTTGGAATGTCACTGGCGATAGCATACGGAGCTACTGTGAGAAATT 8420 

192 eTyrLeuPheProLeuLysMetlleThrTrpThrMetProThrAlaValPhePhelleCy 212 
I I I I 1 1 I I I i I I I I I ! I I 1 I I I I I I i I I I 1 I I I I I 11 I I I I I I 1 I 11 I I I I I I I I I I II 

8421 CTATCTCTTCCCTCTAAAAATGATTACTTGGACAATGCCCACTGCTGTC I I I I I CATTTG 8480 



212 
8481 

232 
8541 

252 
8601 



sMetlleLeuLeulleThrSerLeuGlyArgHi sArgLysLysAlaLeuLeuThnhrSe 232 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM IIIIII Mil MM I II I Mill III Mill I 

CATGATTTTGCTCATCACATCTCTGGGAAGACACAGGAAGAAGGCTCTCCTTACAACCTC 8540 

rGlyPheArgGluProServalGlnAlaHi sileLysAlaLeuLeuAlaLeuLeuSerPh 252 
I I I I I I I I III I I I I 1 1 1 1 I I I I I 1 1 I I ! 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 I I I I I I I 1 1 I t ! I 1 1 I 

AGGATTCCGAGAGCCCAGTGTGCAGGCACACATAAAGGCTCTGCTGGCTCTCCTCTCTTT 8600 

eAl aMetLeuPheil eSerTyrPheLeuSerLeuVal PheSerAl aAlaGl yll ePhePr 272 
I II I I I II I II I I I II I I I Ml II I I II I I I I I II I II II I II I I I I 1 I 1 I II I 1 I I I I I 

TGCCATGCTCTTCATCTCATATTTCCTGTCACTGGTGTTCAGTGCTGCAGGTA I I I I I CC 8660 



272 QProLeuAspPheLysPheTrpvalTrpGl userval il eTyrLeuCysAl aAl aval Hi 
I I I I M I I I I I I M I I I I I I I I I 1 M I I M I I : I I M I I I I M I I I M I I M M I M I I 

8661 ACCTCTGGACTTTAAATTCTGGGTGTGGGAGTCAGTGATTTATCTGTGTGCAGCAGTTCA 

292 sProil ell eLeuLeuPheSerAsnCysArgLeuArgAl aval LeuLysSerArgArgSe 
M II I M II II II I II I II II II M II M III 1 1 1 III 1 1 II 1 1 M I II I M I 111 MM 

8721 CCCCATCATTCTGCTCTTCAGCAACTGCAGGCTG AGAGCTGTGCTGAAG AGTCGTCGTTC 

312 rSerArgCysGlyThrPro 318 
M I M 11 II I I II I I I I I I 
8781 CTCAAGGTGTGGGACACCT 8799 



292 
8720 
312 
8780 



RESULT 2 

AC092214/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



AC092214 266224 bp DNA linear HTG 29-JUN-2001 

Homo sapiens chromosome UNK clone RP11-81U9, working draft 
SEQUENCE, 6 unordered pieces. 
AC092214 

AC092214.1 Gl: 14575807 

HTG; HTGS_PHASEl; HTG S_D RAFT; HTGS_FULLTOP . 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 266224) 
Waterston,R.H. 

The sequence of Homo sapiens clone 
Unpublished 

2 (bases 1 to 266224) 
Waterston.R.H. 
Direct Submission 

Submitted (29-JUN-2001) Genome Sequencing Center, Washington 
university School of Medicine, 4444 Forest Park Parkway, St. Louis, 
MO 63108, USA 

[warning] On Sep 15, 2001 this sequence was replaced by 
gi:15625026. 

Genome Center 

Center: Washington university Genome Sequencing Center 
Center code: WUGSC 

web si te : http : //genome . wustl . edu/gsc/i ndex . shtml 
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